CDS
Accession Number | TCMCG004C48127 |
gbkey | CDS |
Protein Id | XP_025626904.1 |
Location | join(11439105..11439184,11439603..11439961,11440853..11440922,11443950..11444065,11444593..11444624,11444728..11444853,11447010..11447168,11452582..11452674,11452762..11452929,11453659..11453754,11454226..11454461,11454551..11454608,11454971..11455059,11456254..11456385,11456585..11456707,11457201..11457299,11457424..11457612,11457969..11458143,11458275..11458431,11458571..11458770,11461477..11461618,11462587..11462747,11462961..11463077,11463716..11463775) |
Gene | LOC112720249 |
GeneID | 112720249 |
Organism | Arachis hypogaea |
Protein
Length | 1078aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA476953 |
db_source | XM_025771119.2 |
Definition | nuclear pore complex protein NUP107 isoform X1 [Arachis hypogaea] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGGATGACGGAATGGACACCTCTCCCAGCATCTTCGATCCTCAGAACCTCTCCACTAGACAGAAGTTTCGCAGATACGGGAAGAGGCACTCAACTTCTGGTGCTTCAATCCATCAAGATAACTCAGCTTCCAAGTTGAGTGAAACTGGGCTTTTATATGATGGCCAAAGTATCCACAGCCCTACTAATGCTGCACTTCTTCTTGAAAACATTAAACAAGAGGTTGAGGGTCTTGATGCTGAATACTATGAAGAAAAAATACAACCTTCCTCTAAAAGGATGCTGTCTTCTGATATTCAAGGAATCCCTGTTGTTGATGCTGGTTTTGACTCTATACGCCACTCATTAAAAGCTTGTAAACAAGATGGTGACTCATTGGGAGATGGTGCAGAAACGATTTTTACTTTATTTGGATCTCTGCTTGACTGTGCTATGCAAGGATTGATGCCTGTTTCTGATCTGATACTACGTTTTGAGAATGCATGTCGAGATGTTTCAGAATCTATCAGGTATGGTTTGAATGTTAAGCATCGAGTTGTTGAGGACAAATTGATGAGGCAGAAGGCTCAGCTCTTGCTTGATGAGGCTGCAACGTGGTCTTTGCTGTGGTTCGTTTATGGGAAAGTGACTGAAGAACTATCTAAAGAGCAAATACCGGTGTCAGGAACCTCCCATGCTGTGGCTTGTGAGTTTGTTTCAGAAGACCATACTGCTCAATTATGCCTTCGGATAGTCCAGTGGTTGGAGGGTTTAGCTTCAAAAGCTCTTGACTTGGAAGAAAAGGTGCGTGGATCTCATGTTGGTAGTTATCTTCCTAGTTCTGGTGTTTGGCATCATACTCAACGTTACTTAAAGAAAGAGAGAGCTGATATGAACATTGTTCATCACTTGGATTTTGATGCTCCGACTCGTGAAAATGCAAACCTATTGCCTGATGACATGAAACAAGATGAATCTCTTTTAGAAGATGTTTGGACTCTTTTAAGGGCTGGAAGACTAGAAGAGGCATCTGGACTTTGTCATTCTGCTGGACAGCCATGGAGAGCTGCCTCTCTGTGTCCATTTGGAGGTTTGAACCTGTTTCCTTCAGTTGAAGCTCTGGTGAAGAATGGTAAAAGTAGAACATTGCAGGCTGTTGAATTTGAAAGTGGCATTGGTCATCAGTGGCATCTTTGGAAATGGGCTTCATATTGTGCATCAGAGAAAATATCAGAGCTTGGTGGGAAATTTGAAGCTGCTGTATATGCAGTCCAATGTAGCAACTTAAAACGGATGCTTCCATTGTGTACAGATTGGGAGTCAGCATGCTGGGCATTGGCGAAGTCTTGGCTTGATGTGCAGGTAGACTTGGAAATCACACGTTCGCTGCCTGGTGGAATTGATCAACTTAGATCTTTTGGTGATGTAATTAATGGAAGCCCAGGACACGCTGATGGCTCCTTGGATCCCACAGATGGGCCCGAAAATTGGCCTATTCAAGTTTTGAATCAGCAACCACGACAACTTTCATCTCTTCTTCAGAAGCTACATTCAGGTGAAATGATGCATGAGGCTGTAACTCAACAATGCAAGGAGCAACACCGACAAATTCAGATGACTTTAATGCAAGGTGATATACCACGTGTACTGGACCTTATATGGTCATGGATAGCACCATCAGAAAATGATCAGAATATATTTAGGCCTCATGGAGATTCTCAGATGATACGATTTGGTGCACATCTAGTCCTCGTGCTGAGATATTTACTTGCTGAGGAAATGAAAGATACCTTTAGAGACAAGATTCTTAGCGTTGGTGATAACATTTTGCACATGTATGCACTGTTTCTCTTTTCAAAGGAGCACGAGGAGCTGGTTGGCATATATGCTTCTCAGCTTGCATGTCACCGTTGTATTGACCTCTTTGTGCACATGATGGAACTCAGGCTAGACAGCAGTGTACATGTCAAATACAAGATCTTCCTTTCTGCCATTGAGTATTTACCATTTTCCTCCGAGCATGATTCGACGGGCAATTTTGAAGATATTATAGAGAGAATTTTATTGAGATCTCGGGAGATCAAGGCTGGTGAATATGCTGACCTGTCAGATGTTGCAGAGCAGCACAGACTGCAAAGTCTTCAGAAAGCCAAAGCCATTCAATGGCTTTGCTTTACACCACCATCAACAATTCCTAATTTCCAAGATGTTAGTAAAAGATTACTTATCCGAGCATTAACACACAGCAACATACTCTTCAGGGAGTTTGCTCTAATTTCGATGTGGAGAGTACCAGCAATGCCTATAGGTGCGCACACAGCACTTGGTTTTCTCGCTGAGCCCTTGAAACAGCTCTCCGAAACTCCGGAGACGTCGGAAGATGATATTGTTTTTGAGCATCTGAGGGAGTTCCAAGACTGGCGTGAATATTATTCCTGTGATGCAACCTACCGCAATTGGCTCAAACTTGAACTAGAGAATGCAGAAGTTCCTGCCTCTGACCTGTCGTTAGAGGAAAAGAAGAGGGCCATTTCAACAGCAGAGGAAATGCTGACAGCATCTCTTTCACTACTAGAAAGACAAGAAACCCCTTGGCTGGCTTCTATTAACGATGGCTATGAATCGGCTGAACCTGTTTACCTTGAACTTCATGCCACTTCAATGCTATGCTTGCCATCTGGAGATTGTTTGTGTCCAGATGCTACTGTGTGCACTACCCTGATGAGTGCCCTTTACTCATCAGTTGGTCATGAGGTTATCTTAAGCCGACAACTAATGGTGAATGTCTCCATATCTTCAAGGGACAAGTATTGCATTGATGTTGTTCTCCGTTGCTTAGCAATAGCTGGTGATGGACTTGGACCACACAATCTCAATGATGGTGGTATTCTTGGAACAATTATGGCTGCAGGTTTTAAAGGTGAGCTTCCTCGGTTTCAATCTGGGGTAACGTTGGAAATATCCAGATTGGATGCTTGGTACTCCAATAAAGATGGAACCATAGAATACCCAGCAACCTACATTGTGAAAGGACTTTGCCGTAGATGCTGTCTCCCTGAAATCATTCTCCGTTGTATGCAAGTTTCTGTCTCTCTCATGGGATCAGGAGTCATGCCTGATTGCCACGATCGATTGATTGAAATGGTTGGCAGCCCCGAAACTAAGTTTCTTCACTTATTTAGTCAACAACAATTACAGGAGTTTCTATTGTTTGAGAGGGAGTACTCAATCTGCAGAATGGAGCTTACTGAGGTATAA |
Protein: MDDGMDTSPSIFDPQNLSTRQKFRRYGKRHSTSGASIHQDNSASKLSETGLLYDGQSIHSPTNAALLLENIKQEVEGLDAEYYEEKIQPSSKRMLSSDIQGIPVVDAGFDSIRHSLKACKQDGDSLGDGAETIFTLFGSLLDCAMQGLMPVSDLILRFENACRDVSESIRYGLNVKHRVVEDKLMRQKAQLLLDEAATWSLLWFVYGKVTEELSKEQIPVSGTSHAVACEFVSEDHTAQLCLRIVQWLEGLASKALDLEEKVRGSHVGSYLPSSGVWHHTQRYLKKERADMNIVHHLDFDAPTRENANLLPDDMKQDESLLEDVWTLLRAGRLEEASGLCHSAGQPWRAASLCPFGGLNLFPSVEALVKNGKSRTLQAVEFESGIGHQWHLWKWASYCASEKISELGGKFEAAVYAVQCSNLKRMLPLCTDWESACWALAKSWLDVQVDLEITRSLPGGIDQLRSFGDVINGSPGHADGSLDPTDGPENWPIQVLNQQPRQLSSLLQKLHSGEMMHEAVTQQCKEQHRQIQMTLMQGDIPRVLDLIWSWIAPSENDQNIFRPHGDSQMIRFGAHLVLVLRYLLAEEMKDTFRDKILSVGDNILHMYALFLFSKEHEELVGIYASQLACHRCIDLFVHMMELRLDSSVHVKYKIFLSAIEYLPFSSEHDSTGNFEDIIERILLRSREIKAGEYADLSDVAEQHRLQSLQKAKAIQWLCFTPPSTIPNFQDVSKRLLIRALTHSNILFREFALISMWRVPAMPIGAHTALGFLAEPLKQLSETPETSEDDIVFEHLREFQDWREYYSCDATYRNWLKLELENAEVPASDLSLEEKKRAISTAEEMLTASLSLLERQETPWLASINDGYESAEPVYLELHATSMLCLPSGDCLCPDATVCTTLMSALYSSVGHEVILSRQLMVNVSISSRDKYCIDVVLRCLAIAGDGLGPHNLNDGGILGTIMAAGFKGELPRFQSGVTLEISRLDAWYSNKDGTIEYPATYIVKGLCRRCCLPEIILRCMQVSVSLMGSGVMPDCHDRLIEMVGSPETKFLHLFSQQQLQEFLLFEREYSICRMELTEV |